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Abstract - The aim of this paper is to propose a novel Voice On Demand (VoD) architecture and 
implementation of an efficient load sharing algorithm to achieve Quality of Service (QoS). This scheme 
reduces the transmission cost from the Centralized Multimedia Sever (CMS) to Proxy Servers (PS) by 
sharing the videos among the proxy servers of the Local Proxy Servers Group [LPSG] and among the 
neighboring LPSGs, which are interconnected in a ring fashion. This results in very low request 
rejection ratio, reduction in transmission time and cost, reduction of load on the CMS and high QoS 
for the users. Simulation results indicate acceptable initial startup latency, reduced transmission cost 
and time, load sharing among the proxy servers, among the LPSGs and between the CMS and the PS. 

Keywords: Local proxy servers group, load sharing. Proxy caching, VoD architecture. 

1. INTRODUCTION 



The tremendous growth of World Wide Web has resulted in an increase of bandwidth 
consumption throughout the internet. Proxy caching has been recognized as an effective technique 
to reduce network traffic. Caching is also an important mechanism for improving both the 
performance and operational cost of multimedia networks [10]. Recent web access patterns show 
frequent requests for a small number of popular objects at popular sites. So, a popular video can be 
transmitted to the same network link once per request. In the absence of caching, this approach 
results in server over load, network congestion, higher latency and even the possibility of rejection 
of the clients request. Caching proxy servers resolves the above issues by distributing the load 
across the network [3]. 

A VoD system usually has several servers and distributed clients over the entire network. 
These servers contain prerecorded videos and are streamed to the clients upon request from the 
clients. Proxy cache attempts to improve the performance of the overall network communication 
in three ways [9]. They are 
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i. Reduce the user perceived latency associated with obtaining documents as the proxy 
cache is normally close to the user. 

ii. Lower the network traffic as the documents are available to the user and hence reduces 
the load on the network 

iii. Reduce the service load 



In the modem era, a number of caching and buffering techniques have been proposed to 
reduce the startup latency and bandwidth demand between the CMS and PS. Most of these 
techniques use proxy servers with large storage space for caching videos which are frequently 
requested. Cached data is serves the future requests and the uncached portions of the video are 
downloaded from the centralized servers [2]. 

Proxy servers have been widely used for multimedia contents to decrease the startup delay 
and to reduce the load of the centralized multimedia server. Recent works explores the advantages 
of connected proxy servers within the same LAN [3] [5] [8]. 

2. RELATED WORK 



Authors, Tay and pang in [3] propose an algorithm called GWQ (Global waiting queue). This 
algorithm reduces the startup delay by sharing the load in a distributed loosely coupled VoD 
system by balancing the load between the lightly loaded proxy servers and heavily loaded proxy 
servers. They have rephcated the videos evenly in all the servers. Hence, the storage capacity of 
individual proxy server should be very large to store all the videos. However, we propose to store 
the portion of the video only once across the proxies of the LPSG to accumulate more number of 
video blocks. Authors, Sonia Gonzalez, Navarro and Zapata in [4] propose a more reahstic 
algorithm to share the load in a distributed VoD system. They suggest that their algorithm 
maintains a small startup delay using less storage capacity servers by allowing partial replication 
of the videos. They further store the locally requested videos in each server. Depending on the user 
requests, the popularity of the videos and percentage of replication is determined [2] and [7] 
considers the exchange of cached contents among the local proxy server without any coordinator. 
Our work differs in that we have made a group of proxy servers with a coordinator (Tracker) to 
share the videos among the proxy servers of LPSG more efficiently. 

In this paper, we propose an efficient load sharing algorithm and a new VoD architecture for 
distributed VoD system. This architecture consists of a centralized multimedia server which is 
connected to a set of trackers. Each tracker is in turn connected to a group of proxy servers and 
these proxy servers are assumed to be interconnected using ring pattern. This arrangement is 
called as LPSG. Each of such LPSG, which is connected to CMS, is in turn connected to its left 
and right neighboring LPSGs in a ring fashion through its tracker. 



Arranging the group of proxy servers in the form of LPSG provides several advantages: 
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• Request-service delay: Caching the most frequently asked videos across PSs of Lp and 
sharing these videos among the PSs of Lp can exploit the maximum bandwidth usage of 
the local area of networks. 

• Increased aggregate storage space: By distributing the large number of videos across the 
PSs and TR of LPSG, high cache hit rate can be achieved. For example, if 10 PSs within 
a Lp managed 500 Mbytes each, total space available is 5 Gbytes. 200 proxies of LPSG 
could store about 100 Gbytes of movies. 

• Load reduction: Distribution of large number of videos on multiple proxies in LPSG 
provides service to more number of clients by passing downloading of the complete 
video from the main multimedia server. 

• Scalability: By adding more number of PSs the capacity of the system can be expanded. 
Interconnected TRs increases the system throughput. 



The further organization of the paper is as follows: Section 3 analyzes various parameters 
used in the problem. In section 4 we present a proposed approach and algorithm in detail. In 
section 5 we present a simulation model. Section 6 presents the simulation results and discussion. 
Section 7 summarizes the paper and refers to flirther work. 

3. LOAD SHARING PROBLEM FORMULATION 

This section describes the most important parameters and model considered in the evaluation 
process. Simulation details are given in the next section. By analyzing proxy server traces and 
content distribution network statistics, it becomes possible that the web objects are requested 
according to a Zipf-like distribution. The zipf law was originally formulated in terms of popularity 
of words in a text. It states that the frequency of i-th object is 

k 1 

p =— i =1,2, Nwherek = = 

i ^gii 

Zipf law is generalized as follows: 

ia 

Where i assumes the value 0.986 for web object popularity. 
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Table 1: PARAMETERS IN THE MODEL 



Pcifci 


llf*fiTiitinn 
J^CillllllUlI 


C. 




N 


Number of proxy servers in the system 


N 


Number of videos of the LPSG 




Request rate for a specific movie in proxy 
server j(req/min) 


TCOST 


Transmission Cost for ({S;- W;}'''^'' 


Wti 


Ghent's waiting time for a video V; 


B 


Total size of blocks at each PS 


Wi 


Size of (p-l)i cached at PSq, q=l..M 



reference to 



a group of N 

Vi, V2, 

of 



size(duration in minutes), Si with mean arrival rates . . . X n respectively that are being streamed 
to the user using M proxy servers(PSs) PSi...PSm of J LPSGs Lp, where p=l . . J. Each proxy 
server has a caching buffer large enough to cache total B minutes of video and k number of 
videos, W minutes of each video Vi is Wi i=l..N is cached across the PSs of Lp. That is M*B = 
Ilf'Ll Wi and B=Zp=iWl. Let bi be the available bandwidth for Vi between the proxy cache and 
CMS. After requesting for a video Vj, the streaming of that video will be delayed by {TT(Si- 
Wi)™^bi + TT(Wi)''^}.let ?Li > ?L I for 1 < i < j < N , i.e., popularity of these videos decreases 
gradually with the index so that Vi and Vn be the most and the least popular video respectively, 
i.e. minimum number of blocks will be cached for Vi and large number of blocks will be cached 
for Vn- The total arrival rate for all videos is X =Z;''=i J'l. A user request a video Vi with the 
probability pi = Xi/X, i = 1 . . . N. Let the number of videos streamed from Lp be N, leading to 
average user waiting time of Wti(PSq) and average transmission cost TCOST( ) where Wti( ) and 
TCOST() are non-linear functions, i= 1 ... N. The optimization problem to maximize the 
streaming of number of blocks of videos from Lp by sharing the videos among the PSs of LPSG Lp 
so as to minimize the average transmission cost TCOST and user waiting time Wt for a video i at 
a PS q can be formulated as follows: 

Minimize If'LiCSi - Wi) 
Minimize TCOST"({Si- Wi}^^^ ), 
Minimize Wti(PSq) 

Subject to M*B = SiLiWi , B=Iflj.Wi andWi>0 



4. PROPOSED ARCHITECTURE AND ALGORITHM 
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Figure 1. Cooperative Proxy servers Architecture 



4.1 Architecture 

The proposed VoD architecture is as shown Fig. 1 . Our proposed architecture consists of a 
CMS, which is connected to a group of trackers (TRs) where each TR is in turn connected to a set 
of PSs. These PSs are connected among themselves in a ring fashion. And also to each of these 
PSs a large number of users are connected [LPSG]. All these LPSGs are interconnected through 
their TR in a ring pattern. The TR caches some portion of video Vi where i=l . . .N according to the 
frequency of requests for video Vi at any one PS of Lp only once. Then PS transmits cached data 
to the client through LAN using its less expensive bandwidth. The TR is also a PS with high 
computation power, which coordinates and maintains a database that contains the information of 
the videos present in each PS in that Lpand also free cache space at each PS. 

The CMS, the TR and the PSs of LPSG are assumed to be interconnected through high 
capacity optic fiber cables. In the beginning, all the videos are stored in the CMS. TR, caches the 
portion Wi of the video Vi at any one of the proxy PSq of Lp only once, as it streams the video Vi 
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from CMS to the client upon the request. It decides the Size (number of blocks (minutes)) of this 
portion Wj of Vj to be cached in proportion to its popularity i.e size(Wi) a pop(Vi). So, more 
number of blocks will be stored for most popular video. This setting drastically reduces the 
frequent access of CMS by serving maximum number of clients with maximum Wi from Lp. We 
assume that the frequency of requests to a video follows Zipf law of distribution and bandwidth bj 
will be available for (Sj- WO ™^ 

4.2 Optimal Load sharing Algoritlim 

In this section, we explain how our load sharing algorithm handles the user request of various 
scenarios as follows. 

Case 1 : If the Vreq is present in PSq, then the video will be sfreamed to the user immediately 
and hence the initial startup latency is very low, and TCOST ({Sr W,} is very 
low. 

Case 2; If the Vreq is not present in the PSq, then the same is intimated to the TR. The TR 
checks whether this Vreq is present in any of the PS in that Lp. If the Vreq is present in 
one of the PSs in that Lp, then the TR checks whether the PS in which the Vreq was 
found is neighbor to the requested PSq [NBR (PS,)]. If so, the TR(Lp) initiates the 
sfreaming of the Vreq from that NBR(PSq) to the requested PSq and the same is 
mtimated to the requested PSq and hence the Wt(Vreq) and TCOST({Si- W;}'^'^'^ ) is 
very small. 

Case 3: The V,;,q may not be present in the PSq and also it may not be present in the 
NBR(PSq) but may be present in the PS other than the NBR(PSq). Then the TR (Lp) 
intimates the same to PSq and initiates the sfreaming of the Vreq from this PS to the 
requested PSq through the optimal path found by the TR and hence the Wt (Vreq) and 
TCOST ({Si- Wi} ^^^'> is relatively higher, but acceptable with high QoS. 

Case 4: If the Vreq is not present in any of the PSs in that LPSG, then the TR (Lp) Passes the 
request to the tracker of NBR (Lp). That is TR (NBR (Lp)) checks whether the Vreq is 
present in any of the PS in its LPSG using perfect hashing. If the Vreq is present in 
Einy of the PSs in that LPSG, then the TR(NBR(Lp)) selects the optimal streaming 
path from the selected PS(NBR(Lp)) to the requested PSq and intimates the same to 
TR(Lp) and then initiates the sfreaming of the Vreq from the selected PS(NBR(Lp)) 
to the TR(Lp) through this optimal path. Then the TR (Lp) in turn continue the 
sfreaming of Vreq to the requested PSq and the same is intimated to the requested PSq 
and hence the Wt (Vreq) and TCOST ({Si- Wi} ^^^'^ relatively higher, but acceptable 
with high QoS. 

Case 5: If the Vreq is not present in any of the PSs of its NBR(Lp) also, then the TR(Lp) 
decides to download the Vreq from CMS and stream to PSq. So Wt(Vreq) and 
TCOST({Si-Wi}™^ ) is maximum but very few requests Eire served from CMS. 
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Whenever the sufficient buffer and bandwidth is not available in the above operation the user 
request is rejected, which is a very rare possibility as shown by our simulation results. 

2.3 Simulation Algorithm: 

When a request for a video V (Vreq) arrives at a particular time t to PS, of Lp do the following: 

If (Vreq IS prcscut at PSq) 
{ 

Stream the Vreq to the user immediately from PSq 

} 

else 
{ 

Pass request to TR(Lp)-perform perfect Hashing 

If(VreqePS(Lp)) 

{ 

If (PS(Lp) is left or right neighbor to PSq) 
{ 

TR(Lp) initiates the Streaming of the 
Vreq to the user fi-om NBR(PSq) 
through PSq 

} 

else 
{ 

TR(Lp) streams the V^q to the user 
through PSq from PS(Lp) using 
the optimal path found. 

} 

} 

else 

{ 

Pass the request to left or right NBR (Lp) 

If(VreqePS(NBR(Lp))) 
{ 

TR(NBR(Lp)) Streams of the 

Vreq to the user from PS(NBR(Lp)) 

through TR(Lp) and PSq using the optimal path found 

} 

else 

{ 

TR(Lp) downloads and streams the 
Vreq to the uscr through PSq from CMS 
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} 

} 

} 

5. SIMULATION MODEL 



Our simulation model consists of a single CMS and a set of 6 TRs. All these TRs are 
interconnected among themselves in a ring fashion. Each of these TR is in turn connected to a set 
of 6 PSs. These PSs are also interconnected among themselves in a ring fashion. We use the video 
hit ratio (VHR) to measure the performance of our proposed approach more correctly. In addition 
we also use the average client waiting time, number of access to main server and the network 
transmission cost as performance metrics. 

We assume that the request distribution of the videos follows a zipf-like distribution and 
transmission delay between the proxy and the client as 100ms, transmission delay between proxy 
to proxy as 200ms, transmission delay between TR and PS as also 200ms, transmission delay 
between the main server and the proxy as 1200ms, transmission delay between tracker to tracker 
300ms, and the size of the cached video as 280MB to 1 120MB (25min - Ihr) in proportion to its 
popularity. 

6. SIMULATION RESULTS 



The simulation results presented below are an average of several simulations conducted on the 
model. Consider Fig.2, which shows the average number of blocks of videos served from various 
proxies of LPSG and NBR (Lp), and the average number of blocks of videos served from CMS 
which is only about 39%. 

As the maximum amount of most frequently asked videos have been cached and sfreamed 
from the LPSG with cooperation of PSs, and the coordination of TR of LPSG, Our scheme 
increases the video hit ratio and reduces the number of access to CMS by 30% -40% as shown in 
Fig 4 and Fig. 5. So the network fraffic along the server —client path reduces, in turn transmission 
cost and transmission time is also reduced significantly when compared to the system with single 
proxy as shown in Fig 3. More number of blocks of frequently requested videos are cached and 
shared among the proxies of Lp. So when there is a request for any of these i"* video, streaming 
starts from one of the PS immediately and hence the Wt (Vi) and TCOST ({Si- Wi} CMS) is very 
less. 

If the requested videos are present at NBR (PSq) of Lp, then these videos are streamed from 
NBR (PSq) to the client through PSq, so the clients Wt for these videos is very small. And if the 
requested videos are present in OTR (PSq), then these Videos are sfreamed from OTR (PSq) to 
the client through PSq, so the client's Wt for these videos is relatively higher. Otherwise also, 
some good number of videos are served from NBR (Lp), which reduces frequent downloading of 
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requested videos from CMS to the PSq which in turn reduces the initial playout delay at the clients 
for the requested videos which are not present at PSq. 

And a very few numbers of blocks of videos are streamed from the CMS, when the Vreq is 
neither present in that Lp, nor in NBR (Lp). Even though the initial startup delay and transmission 
cost seems to be more as shown in fig.6, it is acceptable because on an average 60%-80% of the 
videos are cached and streamed from Lp and NBR (Lp) by assuring high QoS as shown in Fig.2 
and Fig. 3 and only 40 %-20% of the videos are downloaded from CMS which reduced the number 
of access to CMS and hence the transmission cost and time reduced significantly. 
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Figure. 2 No. of Blocks of videos streamed from 
Lp+NBR(Lp), CMS Vs Time 
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Figure 6. Average waiting time for video from PS,NBR(PS), 
Lp,NBR(Lp) and MS 



7. CONCLUSION 

In this paper we have proposed an efficient video sharing mechanism and an architecture 
where proxies cooperate with each other to achieve reduced transmission cost and initial start up 
delay, by caching and streaming maximum portion of the most frequently requested videos among 
the proxies of Lp. Our simulation results demonstrated that our proposed approach has reduced the 
average network transmission cost of the system, initial startup delay for the videos requested at 
PSq, and also the load of CMS by caching maximum portion of the most popular videos in 
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proportion to their popularity across the PSs of Lp. And sharing of these videos among the proxies 
of the system also reduces the server-to-client transmission cost and time, maintains high QoS for 
the users and has removed the video redimdancy among the proxy servers. The future work is 
being carried out to improve the performance using dynamic buffer management at PS,. 
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